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Abstract. In this paper, we embed metric space endowed with a convex combination operation, named convex 
combination space, into a Banach space and the embedding preserves the structures of metric and convex combination. 
For random element taking values in this kind of space, applications of embedding are also established. On the 
one hand, some nice properties of expectation such as representation of expected value through continuous affine 
mappings, the linearity of expectation will be given. On the other hand, the notion of conditional expectation will 
be also introduced and discussed. Thanks to embedding theorem, we establish some basic properties of conditional 
expectation, Jensen’s inequality, convergences of martingales and ergodic theorem. 
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1 Introduction 

Probability theory in linear spaces has long been considered and extended to more general models which are nonlinear, 
such as hyperspaces of linear space or metric spaces generally. Basic objects such as expectation, conditional expec¬ 
tation of random element taking values in metric space also have attracted attention of many researchers. Probably 
the first author introduced a concept of mathematical expectation of a random element with values in a metric space 
was Doss [5] in 1949. After this paper, other authors gave many different definitions of expectation and conditional 
expectation in different kinds of metric spaces via various ways. We can mention the works of Emery and Mokobodzki 
[6], Merer [9, 10, 11], Raynaud de Eitte [16], Sturm [4, 19], or the monograph of Molchanov [12]. 

In 2006, Teran and Molchanov [21] introduced the concept of convex combination space and the class of these 
spaces is larger than not only the class of Banach spaces but also the class of hyperspace of compact subsets, as 
well as the class of upper semicontinuous functions (also called fuzzy sets) with compact support in Banach space 
[21]. Besides, the authors also provided many interesting illustrative examples of this concept, e.g., the space of all 
cumulative distribution functions or the space of upper semicontinuous functions with f-norm. Convex combination 
space is a metric space endowed with a convex combination operation and the extension from linear space to convex 
combination space is not trivial. Some very basic sets, such as singletons and balls, may fail to be convex in convex 
combination space. This may not match with usual intuition but occurs in many practical models. Eor example, 
consider the hyperspace of all compact subsets of Banach space with the convex combinations being generated by the 
Minkovski addition and scalar multiplication. Then XA -f (1 — A)A does not equal to A unless A is convex, it means that 
A is non-convex singleton in such a space. Another example is the space of integrable probability distributions, where 
the convex combinations is generated by the convolution operation (see [21, 22]). Eor random element taking values 
in convex combination space, its expected value was constructed by Teran and Molchanov. This notion of expectation 
extended the corresponding one when considering not only in Banach space but also in hyperspace of compact subsets. 
Eurthermore, the authors also established the Etemadi strong law of large numbers (SEEN) for normalized sums of 
pairwise independent, identically distributed (i.i.d.) random elements in this kind of space ([21], Theorem 5.1), other 
applications can be found in [18, 22, 23]. 

Although convex combination space may have many singletons being not convex, it always contains a subspace 
(we will call convexifiable domain) in which every singletons and balls are convex, moreover the authors in [21] shown 
that this subspace has some properties resembling linearity. Therefore, it is natural to ask whether this convexifiable 
domain can be embedded isometrically into some normed linear space such that the structure of convex combination is 
preserved. A worth note is that the expectation of every integrable random element taking values in convex combination 
space always belongs to this convexifiable domain. Therefore, if embedding is established, we will have more tools to 
explore this type of expectation as well as properties of convex combination space. 

In this paper, we will answer the question mentioned above. Namely, we will show that the convexifiable domain 
of a complete convex combination space can be embedded into a Banach space such that the embedding is isometric 
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and the structure of convex combination is preserved, this will be presented in Section 3. 

Main applications of the approach via embedding theorem will be presented in Section 4. On the one hand, some 
nice properties of expectation including both representation of expected value through continuous affine mappings 
and Jensen’s inequality (was proved first by Teran [22] and will be proved again in this work in another way) will be 
given. On the other hand, the notion of conditional expectation of integrable random element taking values in convex 
combination space will be also introduced and discussed. Thanks to embedding theorem, we establish some basic 
properties of conditional expectation, Jensen’s inequality, convergences of martingales and ergodic theorem. 

Finally, some miscellaneous applications and remarks will be discussed in Section 5. 


2 Preliminaries 


For the reader’s convenience, we now present a short introduction to the approach given by Teran and Molchanov in 
[21]. Let be a metric space, for m,x € X, we denote \\x\\u-= d{u,x). Based on X, introduce a convex combination 
operation, which for all n ^ 2 , numbers Xi,... ,X„ > 0 that satisfy ^ 4=1 all mi ,..., G X, this operation 

produces an element of X, which is denoted by [Ai,mi; ... ;A„,m„] or Note that [Ai,mi; ... ;A„,m„] and the 

shorthand [Ai,M,]]Lj have the same intuitive meaning as the more familiar Xiui H-h A„m„ and ^ 4=1 but X is 

not assumed to have any addition or multiplication. Suppose that [1 ,m] = m for every m G X and that the following 
properties are satisfied; 

(CC.i) (Commutativity) [Xi, m,][Li = [■^a(!) ? “a(!)]5Li for every permutation O’ of {1,..., n}; 

(CC.ii) (Associativity) [A,',M,-]'‘+f = [Ai,mi; ... ;A„,m„;A„+i + A„+ 2 , 

(CC.iii) (Continuity) if m,v G X and —>■ A G (0; 1) as k —> 00 , then [X^^\u',l — —>■ [A,m; 1 — A,v]; 

(CC.iv) (Negative curvature) if mj , M 2 , vi, V 2 C ^ ^ G (0) 1 )> then 

d{[X,ui;l -A,m2],[A,vi;1 - A,V2]) < Xd{ui,vi) + (1 -X)d{u2,V2)-, 


Based on the inductive method and (CC.ii), this axiom can be extended to convex combinations of n elements, as 
follows: if Ui,Vi G X, Xi G (0; 1) with ^ 11=1 k = 1, then < L”=i Xid{ui,Vi). 

(CC.v) (Convexification) for each m G X, there exists lim„^oo[n^SM]'Lp which will be denoted by K^u (or Ku when 
no confusion can arise), and K is called the convexification operator. 

Then, the metric space {X,d) endowed with a convex combination operation is referred to as the convex combination 
space (CC space for short) and we denote (X,t/, [.,.]) or X shortly. We can find from axiom (CC.v) that [n^*,M] is 
different from m in general, so Ku and m may be not identical. If Ku = u, then u will be called convex point of X, 
subspace K{X) will called convexifiable domain. If K{X) = X then X is said to be convexifiable and then .] will be 
called unbiased convex combination operation. Conditions (CC.i)-(CC.v) above imply the following properties: 

(2.1) ([21], Lemma 2.1) For every mu, .. .,Umn G X and ai,...,j3i,..., j3„ >0 with Y4L1 OCi = L/=i we 


have [a,-, = [ail5j,Uij]‘.J‘jJ. 

(2.2) ([21], Lemma 2.2) The convex combination operation is jointly continuous in its 2n arguments. 

(2.3) ([21], Proposition 3.1) The convexification operator JG is linear, that is K{\Xj,Uj\"^^) = 

(2.4) ([21], Corollary 3.3) If m G X and Xi,...,Xn > 0 with YX'j=i^j = then J^([A;,m]”^j) = Ku = [Xj,KuY'j^^. 
Hence, K is an idempotent operator in X. 

(2.5) ([21], Proposition 3.5) For ^1,^2, A3 > 0 with Ai + A2 + A3 = 1 and m,v G X, 


[Ai, m; A 2 , Kv, A 3 , Kv\ = [Aim; (A 2 + A 3 ), Kv]. 

(2.6) ([21], Proposition 3.6) The mapping K is non-expansive with respect to metric d, i.e., d{Ku,Kv) ^ d{u,v). 
Remark 1. Let A*, C (0; 1), A^, —0 and m, v G X. By (CC.iv) and property (2.4), we have 

d{[Xk,Ku-, \-Xk,Kv\,Kv) = d{[Xk,Ku-, \ - Xk,Kv],[Xk,Kv, \-Xk,Kv]) < Xkd{Ku,Kv) -g 0 

as k — 00 . It follows [Xk,Ku-, 1 — Xk,Kv\ Kv and this remark ensures to extend weights A, from (0; 1) to [0; 1] 
for elements in K{X), it means that we can define [A,,x,],g/ = [A,',x,],gy, where x,- G K{X), Y.ieik = ILiejk = 1. 
J = {i&I:Xi> 0}. 

Proposition 2.1. If {X,d) is a separable and complete CC space, then so is {K{X),d). 

Proof. The separability of K{X) is obvious. It follows from Proposition 3.7 in [21] that K{X) is a closed subset of 
complete metric space X, hence K{X) is complete. □ 
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3 Embedding theorem 


First, we need to recall the embedding for convex structure given by Swirszck [20]. In his work, Swirszck introduced 
the notion of semiconvex set as follows: A semiconvex set is a set S together with a family of binary operations {Px : S x 
S— >■ §,A G (0; 1)} satisfying the following axioms: Forx,y,z G S and G (0; 1), (S.i) (Reflexivity)P;L(x,x) =x; (S.ii) 
(Symmetry) Px{x,y) = P(i-X){y,x)\ (S.iii) (Associativity) Pr{Px{x,y),z) = Prx{x,P^{y,z)) for r = ix/{l-X+Xp). 
Sometimes for completeness, we also include the binary identity functions Pi and Pq defined as Pi{x,y) =x and 
Po(x,y) = y. Then (S.i) and (S.ii) hold for X G [0; 1], and (S.iii) holds with A(1 — p) ^ 1. Also in [20], the author 
also shown that a semiconvex set S may be embedded as a convex subset of a vector space if and only if it satisfies 
cancellation law, that is, Pr{x,y) = Pr{x,z) for any G §, r G (0; 1) implies that y = z. Therefore, if the cancellation 
law in S holds, then there exist a vector space (V, + ,.) and an one-to-one correspondence p:S—:>p(S)=UcV such 
thatp(P;L(x,y)) = Xp{x) + {I - X)p{y) for all x,y G S, A G [0; Ij. For more details, the readers can refer to [ 8 , 20]. 

Proposition 3.1. If{X,d, [.,.]) is a CC space, thenK{X) is a semiconvex set with Px{x,y) = \X ,x',l — X ,y], x,y & K{X), 
A G [0;1]. 

Proof. It is easy to see that the axioms (S.i), (S.ii) and (S.iii) are implied by property (2.4), (CC.i) and (CC.ii) respec¬ 
tively. □ 

The following proposition establishes a metric cancellation law in K{X) and it plays the key role in obtaining the 
embedding theorem. 

Proposition 3.2. (Metric cancellation law) Let X is a CC space and x,y,z G K{X), X G [0; Ij. Then, 

d{[X,x\ 1 - A,y], [A,x; 1 - A,z]) = (1 -X)d{y,z). 

In particular, the algebraic cancellation law holds, i.e., if [X,x; 1 — A,y] = [X,x; 1 — A,z] for some X G [0; 1), then 
y = z. 

Proof If A = 0 or A = 1, then the conclusion is trivial. We now consider A G (0; 1). 

Step 1. - The first auxiliary result: If Xk C (0; 1) and Xk -> 0, then [Xk, m; 1 — Xk,Kv\ —> Kv as k — 5 - oo for m, v G X. It 
is easy to see due to 


d{[Xk,u\l -Xk,Kv\,Kv) = d{[Xk,u\l -Xk,Kv\,[Xk,Kv,l 


Xk,Kv\) < Xkd{u,Kv) —)■ 0 


as k —> oo. 

- The second auxiliary result: If m, v G /r(X) then d{[X,u-, 1 — A, v],m) = (1 — X)d{u,v) and c/([A,m; 1 — A, v], v) = 
Xd{u,v). Indeed, by (CC.iv) and (2.5) 

(f([A,M; 1 — A,v],m) = t/([A,M; 1 — A,v], [A,m; 1 — A,m]) ^ (1 — A)t/(M,v) 
c/([A,m; 1 — A,v],v) = t/([A,M; 1 — A,v], [A,v; 1 — A,v]) ^ Xd{u,v) 

and by triangular inequality, 

c/(m,v) ^ c/([A,m; 1 — A,v],m) -|-c/([A,m; 1 — A,v],v) ^ (1 — A)c/(m,v) +Xd{u,v) = d{u,v). 

Thus, c/([A,m; 1 — A,v],m) = (1 —X)d{u,v) and d{[X,u-, 1 — A,v],v) = Xd{u,v). 

Step 2. We denote by m{x,y) = [1 /2,x; 1 /2,y] the midpoint of x,y and it is easy to see that m{x,y) also belongs to 
K{X). By (CC.iv) we have 

d{m{x,y),m{x,z)) = ii([l/ 2 ,x; l/ 2 ,y], [l/ 2 ,x; l/ 2 ,z]) < 2^^d{y,z). 

A set of four ordered points (x,y,z,t) is called parallelogram (according to this order) if m{x,z) = m{yf ). In this 
step, we will prove that if {x,y,z,t) is a parallelogram then d{x,y) = d{t,z). Without loss of generality, assume that 
d{x,y) ^ d{t,z). Now it is sufficient to prove that d{x,y) ^ d{t,z). Putting m{x,z) = m(jf) = mi, we have 

d{mi,m{y,z)) = d{m(jf),m{y,z)) < 2 ^'<i(f,z), 

d{m\,m{y,z)) = d{m{z,x),m{z,y)) ^ 2^'c/(x,y). (3.1) 
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Moreover, 


m{m{x,t),m{y,z)) = [1/2, [1/2,x; 1/2,f]; 1/2, [1/2,y; 1/2,z]] = [l/4,x; l/4,y; l/4,z; l/4,f] 

= [ 1 / 2 , [l/ 2 ,x; l/ 2 ,z]; 1 / 2 , [l/ 2 ,y; l/ 2 ,f]] = [l/ 2 ,mi; l/ 2 ,mi] = mi, 

it means that mi is also the midpoint of m{x,t) and m{y,z). Thus d{mi,m{y,z)) = 2^^d{m{x,t),m{y,z)) by Step 1. 
Combining with (3.1) we obtain 

d{m{x,t),m{y,z)) ^d{t,z) and d{m{x,t),m(j,z)) ^ d{x,y). (3.2) 


On the other hand, 

m{x,m{y,z)) = [l/2,x; 1/2, [l/2,y; 1/2,zj] = [l/2,x; l/4,y; l/4,z] = [l/4,x; l/4,y; l/2,mi] 

= [1/4,x; 1/4,y; 1/2, [1/2,y; 1/2,/]] = [l/4,x; l/2,y; 1/4,/] = m(y,m(x,/)) 

and it implies that (x,y,m(y,z),m(x,/)) is a parallelogram. Applying (3.2), we obtain 

d[m^^\x,t),ni^'^\y,z)) ^d{m{x,t),m{y,z)) ^d{t,z) and d{m^^\x,t),m^^\y,z)) ^d{x,y), 

where mP)(x,/) =m(x,m(x,/)) = [3/4,x; 1/4,/], mP)(y,z) = m{y,m{y,z)) = [3/4,y; l/4,z]. Continuing this process, 
we derive 

d{rnP‘\xd),mP‘\y,z)) ^d{t,z) and d{rn!'^'^ {x,t)im!'^\y,z)) ^d{x,y) for all 1: G > 3 (3.3) 

withmW(x,/) =m(x,m(*^-')(x,/)) = [(2* - l)/2^x; l/2^/], m^'^\y,z) = [(2^= - l)/2^y; l/2^z]. Taking ^ oo in 
(3.3), applying Step 1 and the continuity of metric d, we obtain d(x,y) ^d{t,z). This completes Step 2. 

Step 3. The proposition will be completed in this step. Putting u = [A,x; 1 — A,y], v = [A,x; 1 — X,z] and w = 
[A,y; 1 - A,z], we get 

m{u,w) = [l/2,[A,x;l -A,y];l/2,[A,y;l -A,z]]= [A/2,x;l/2,y;(l -A)/2,z] 

= [l/2,[A,x;l -A,z];l/2,y] = [A/2,x; l/2,y; (1 - A)/2,z]. 

Thus, {u,v,w,y) is a parallelogram and it follows from Step 2 that d(u,v) = d{y,w). On the other hand, d{y,w) = 
d{y, [A,y; 1 — A,z]) = (1 —X)d{y,z) by Step 1 , so d{u,v) = (1 —X)d{y,z). The proposition is proved. □ 

Theorem 3.3. Let {X,d, [.,.]) be a complete and convexifiable CC space. Then, there exist a Banach space (E, jj.jj) 
and a map / : X —>■ E, where /(X) = F A a subset o/E such that 
(/) F is closed and convex; 

(ii) j{[X,x;l - X,y]) = Xj{x) + {1 - X)j{y) for every x,y G X, A € [0;1]; 

(Hi) d{x,y) = \\j{x)-j{y)\\forallx,y G X. 

Furthermore, ifX is separable then E is also separable. 

Proof. Since X is convexifiable, it follows from Proposition 3.1, Proposition 3.2 and the result of Swirszck [20] men¬ 
tioned above that there exist a vector space (V, + ,.) and an one-to-one correspondence p : X ^ p (X) = U C V such 
that U is a convex subset of V and p([A,x; 1 — A,y]) = Xp{x) + {1 —X)p(y) for all x,y G X, A G [0; 1]. Thanks to 
translation, we can assume without loss of generality that 0 : = Ov G U and denote (0) = 9 G X. This ensures that 
if u belongs to U then Xu also belongs to U whenever A G [0; 1], moreover Xu — p([A,x; 1 — A, 0]), where p(x) = u. 
The metric structure on U is induced naturally from the corresponding one on X, and we also use symbol d to denote 
the metric on U. Namely, if m = p(x), v = p(y) G U, then d{u,v) = d{p{x),p{y)) = d{x,y). Thus, if (X,c/) is complete 
(resp. separable) then (U,^/) is also complete (resp. separable). From Proposition 3.2, we have 

(i(AM,Av) = c/([A,x; 1 — A, 0], [A,y; 1 — A, 0]) = Ac/(x,y) = A(i(M, v), for A G [0; 1] and m,v G U. (3.4) 

Let us denote by IK = {Am : m G U, A ^0} the subset of V containing U. For x,y G K, they will have form x = au, 
y = j5v with a,p > 0, m, v G U, then x-fy = au + j5v — {a + It implies from the convexity of U 
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that ■^^u + G U. Hence, x + y G K and K is a convex cone of V. We define the mapping c/*: K x K —>■ [0,oo) as 
follows: 


c/*(0,0) =c/(0,0) =0; 

( (X 
C)C fj 

The mapping is well-defined, independent of the choice of au and pv. To see this, let x = a'u\y = p'v', a' 
u',v' G U, then au = a'u', j3'v' = j3v and using (3.4) (note that in degeneration cases a + j3 = 0 or a' + j3' = 0, the 
proof is trivial), 


a 


, forx = au,y = j3v, a, j3 ^ 0,a + j3 > 0,m, v G U. 


dt{au,liv) = {a + j5).d(^ 


a p 

-M, 


a + j3 ’ a + j3 
(a + j3 + a' + j3').t/( 


= {a + p + a' + j5').d(^ 


a 


a + p + a' + p' ' a + p + a' + p' 


a 


P' 




a^P + a' + P' 'a + p + a' + p 

P' 




It is clear that if {x,y) G U x U then dif(x,y) = d{x,y), and (3.4) can be extended for {x,y,X) from U x U x [0; 1] to 
K X K X [0,oo) by 


dt{Xx,Xy) = dt{Xau,Xpv) = X{a + P).d(^ 


a p 

-M, 


a + P ' a + P 


= Xdif{au, pv) = Xdif{x,y), (3.5) 


for X ^ 0 and x,y G K. We now show that c/* is a metric on K. Indeed, the symmetry and non-negative of c/* are 
clear. If ii*(x,y) = 0 then d = 0 we obtain -^^u = It follows au — pv and x = y. Now for 

X = au,y = Pv,z = yw gK, u,v,w €l],a,P, y^Q, applying (3.5) 


d*{x,y) = {a+p + Y).d^:(^ 

^ {a + p + Y).d(^ 


a 


a + p + Y ’ a + j3-l-7 


a 


a + p + Y ’ a + P + Y 

= dt{au,Yw) +dt{Y^,pv) = d^.{x,z) +d^.{z,y) 


= {a + p + Y)-d(^ 
+ {a + p + Y)-d(^ 


a 


a + p + Y ’ a-l-j3-l-7 

7 p 

-w,———V 


a + j3 + 7 ’ a + j3-l-7 


we obtain the triangular inequality. On the other hand, 

dt{x + z+ + z) = t/*(aM + 7w,j3v-l-7w) 

( oc Y p Y \ 

2(a + j3 + 7)^'*" 2(a-fj3 + 7)'^’ 2{a + P + Y)'^~^ 2(a-l-j3 + 7) / 

= 2 (« + p + r)^<t{ 2(c, + IS + y) “ ■ 2(a+'|3 + r) ’') = 

it means that the metric c/* satisfies the cancellation law in K. Recall that in degeneration cases, the proofs of triangular 
inequality and cancellation law are easy and we omit them. Applying Radstrom’s embedding theorem ([15], Theorem 
1), there exist a real normed linear space (B, ||.||) and a map _/ : K —> j{K) = W C B such that: (a) j{Xx + lly) = 
Xj{x) + Hj{y) for x,y G K and X,ix ^ 0; (b) dt{x,y) = \\j{x) — 7(7)11 for all x,y G K; (c) W is a convex cone of B. 
Moreover, we can choose the normed linear space such that it is complete, i.e., B is a Banach space (if necessary, we 
denote by B the completion of B and embed K to B). It is not hard to check that 7 (U) is a convex subset contained in 
B, complete under the metric induced by the norm of B. Putting 7 = joP : X B and F = 7 (X), we find that F is a 
closed, convex subset of B, moreover j{9) =0. Define E to be the closed linear subspace of B generated by F. It is 
easy to check that the subspace E is a Banach space and the conclusions (i), (ii), (iii) of theorem hold. The remaining 
conclusion when X is separable, then F is too and this implies the separability of E, so this observation completes the 
proof. □ 
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In 2011, Brown [2] introduced the notion of convex-like structure in metric space and it was suitably restated in 
[3] as follows. Let {X,d) be a complete metric space. Take = X x • • • x j£ to be the n-fold Cartesian product 
and Prob„ the set of probability measures on the n-element set {1,2,... ,n} endowed with the fi-metric ||/r — v|| = 
LiLi Im( 0 ~ ^(01- We say that {X,d) has a convex-like structure if for every n € N and pt G Prob„ there is given a 
continuous map 7 ^ : —?■ X such that 

( 7 .1) 7^(xi,... ,Xn) = 7fioa{xa(i), ■ ■ ■ ,-^a(n)) for every permutation a of {1,... ,n}; 

( 7 . 2 ) if xi =X 2 , then 7 ^(xi ,X 2 ,... ,x„) = 7 v(xi,X 3 ,... ,x„), where v G Prob„_i is given by v(l) = /r(l) -t-/r(2) and 
v(;) = M(; + i ).2 0 ^n-i; 

( 7 . 3 ) iffl (0 = 1 , then 7 ^(xi,... ,x„) = x,-; 

( 7 . 4 ) <i( 7 ^(xi,... ,x„), 7 ^( 7 i,... ,y„)) < U'i=ill{i)d{xi,yi) for allyi,... , 7 „ G X; 

( 7 . 5 ) for all pi G Prob„, ^2 G Prob^, v G Prob 2 , then Yv{Y^ll{xu ■ ■ ■ ,Xn),Y^l 2 (yu ■ ■ ■ ^ym)) = Ynixu ■ ■ ■ ,Xn,yi, ■ ■ ■ , 7 m), 
where 77 G Prob„+m is given by rj{i) = v(l)/ri (i), 1 ^ ^ n and r]{j + n) = v( 2 ) 712 ( 7 )) 1 ^ 7 ^ 

Proposition 3.4. Let (X,c/) be a complete metric space. Then, X is a convexifiable CC space if and only if X has 
a convex-like structure. In other words, a convexifiable CC space and metric space with a convex-like structure are 
identical. 

Proof. On X, when a convex-like structure and a convex combination operation determine each other by the identity 

7 ^(xi,...,x„) = [ 7 l(l),xi;...; 7 l(n),x„] for 71 GProb„, 

then the axioms ( 7 .I) and ( 7 . 4 ) are equivalent to the axioms (CC.i) and (CC.iv) respectively. 

- Suppose that X is a convexifiable CC space. Then the axioms ( 7 . 2 ), (7.3), (7.5) follow from (2.5), Remark 1, (2.1) 
respectively. Hence X has convex-like structure. 

- Suppose that X has a convex-like structure. Then, the axiom (CC.ii) follows from (7.5); axiom (CC.v) is satisfied 
thanks to ( 7 . 2 ) and in this case, the operation [.,.] is unbiased. In order that X becomes a convexifiable CC space, it 
remains to check the axiom (CC.iii). Namely, for m, v G X and A*, —> A G (0; 1), we need to prove that Yx^,\-X^ {u, v) —5- 
7 a,i-a(mx) as k —)■ 00 , where Yx.i-X is ^ convenient notation of 7 ^ for 71 G Prob 2 , 7 l(l) = A, 71 ( 2 ) = 1 — X. For 
0 < a ^ j3 < 1 , 

d{Ya,i-a{u,v),Yp,i-p{u,v)) = d{Y7^iu,v,v),Yri{u,u,v)) (by ( 7 . 2 ) with 77 ( 1 ) = a,ri(2) = P - a,ri(3) = 1 -j3) 

(l3-a}d(u,v) (by ( 7 . 4 )). 

Changing the role of a,l3, we ohtciin d(Ya,i-a(u,v),Ypj-p(u,v)) ^ — a\d{u,v) for a,j3 G (0;1). Applying this 

inequality, we have (CC.iii). □ 

Remark 2. After all proofs in this paper completed, we have just been known the notion of convex-like structure 
by the supplying of Tobias Fritz and have been aware that a similar result to Theorem 3.3 was established before by 
Capraro and Fritz in [3]. In their work, they proved that a convex-like structure is affinely and isometrically isomorphic 
to a closed convex subset of a Banach space ([3], Theorem 9). Combining this result with Proposition 3.4 above, a 
convexifiable CC space also can be embedded into Banach space. However, the scheme for embedding in our proof 
is slightly different from theirs, our final goal for embedding is to apply Radstrom’s result. To be more specific, in 
[3], Theorem 9: Convex-like structure on X —)■ establish algebraic cancellation law —embed X into vector space (by 
Stone’s embedding) -> prove the translation-invariant of metric —extend metric to affine hull and to whole vector 
space which becomes Banach space; while in Theorem 3.3: Convexifiable CC space X —establish metric cancellation 
law and as its corollary, obtain algebraic cancellation law —embed X into vector space (by Swirszck’s embedding) — 
construct convex cone containing X and metric in this cone -G embed into Banach space (by Radstrom’s embedding). 
Therefore, we still present Theorem 3.3 as an independent rediscovery of Theorem 9 in [3]. 

4 Applications 

Throughout Section 4 and Section 5, ,P) is a complete probability space without atoms, for A G the notation 

I{A) (or I a) is the indicator function of A. 

Suppose that (X,c/) is a metric space and is a sub-cr-algebra of A mapping A : X is said to be 

measurable if X^^{B) G ^ for all B G where A§{X) is the Borel cr-algebra on X. An .^-measurable mapping 
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will be called random element and when a random element X takes finite values in j£, it is called a simple random 
element. A random element A : —)■ X is said to be p-order integrable (p > 0) if d'’{u,X) is an integrable real-valued 

random variable for some m € X and when ;? = 1, A is said to be integrable briefly. Note that this definition does 
not depend on the selection of element u. The space (of equivalence classes) of all ^^-measurable, p-order integrable 
random elements in X will be denoted by ($#). We also use to denote (^) and the metric on is defined 

hyAp{X,Y) = {EdP{X,Y)yiP,p^\. 

The distribution Px of an X-valued random element X is defined by Px{B) = P{X^'-{B)),\/B G .^(X), and two 
X-valued random elements X,Y are said to be identically distributed if Px — Py- The collection of X-valued random 
elements {Xi,i G /} is said to be independent (resp. pairwise independent) if the collection of cj-algebras {<j{Xi),i G /} 
is independent (resp. pairwise independent), where (J(X) = {X^^{B),B G ^{X)}. 

Next, we recall some notions introduced by Teran and Molchanov [21]. Assume that (X,c/) is a separable and com¬ 
plete CC space. Forasimple random element A = the expectation of A is defined by isA = [P(f2,),Ax,]'Lj. 

It is easy to prove that if A, T are simple random elements, then d {EX ,EY) ^ Ed{X ,Y). 

We fix Mo G A(X) (by (CC.v), K{X) ^ 0) and mq will be considered as the special element of X. Since the metric 
space X is separable, there exists a countable dense subset {uj,j ^ 1} of X. For each n ^ 1, we define the mapping 
V4 : X —^ X such that \j/„{x) = where m„(x) is the smallest i G {0,... ,n} such that d{ui,x) = mmo.^j.^„d{uj,x). 

Then, d{uQ, V4 W) ^ 2d{uo,x) for all n and all x G X. 

Since X is separable and complete, an integrable X-valued random element can be approximated by a sequence 
of simple random elements. Namely, for A € then A = lim„^.>o V^n(A) and the expectation of A is defined by 
EX = lim„^ooE\i/„{X). By the approximation method, we also prove that if A,T G then d{EX,EY) ^ Ed{X,Y). 

A set A C X is called convex if [A,, G A for all m,- G A and positive numbers A,- that sum to 1. For A C X, we 
denote as coA the convex hull of A, which is the smallest convex subset containing A, and coA is the closure of coA 
in X. Let k{X) (resp. ck{X)) be the set of nonempty compact (resp. convex compact) subsets of X and denote by 
Dx the Hausdorff metric on k{X), that is Dx{A,B) = max{sup^g^inffogB(f(a,fi),sup^g^infag, 4 t/(fi,a)} for A,B G k{X). 
It follows from Theorem 6.2 [21] that if X is a separable complete CC space, then the space k{X) with the convex 
combination 

= {[h,Ui\l=i : Ui G Ai, for all /} 

and Hausdorff metric Dx is a separable complete CC space, where the convexification operator K^(x) given by 

Kk{x)^ =^Kx{A) =co{Kxu : u C A}. 

This is a nice feature of CC space. Based on this property, if a result holds for elements in CC space then it can be 
uplifted to the space of nonempty compact subsets. In addition, Kj^(^x){^{^)) = ck(Ax(X)) by Proposition 5.1 in next 
section. Further details can be found in [21]. 

From now until the end of paper, we always assume that {X,d) is a separable and complete CC space. Proposition 

2.1 implies that {K{X),d) is also separable, complete and convexifiable CC space. Therefore, it follows from Theorem 
3.3 that K{X) can be embedded isometrically as a closed, convex subset of separable Banach space E via mapping j. 
Moreover, if A is an integrable X-valued random element, then KX is an integrable A(X)-valued random element. 

4.1 On some properties of expectation 

Theorem 4.1. Let X be an integrable X-valued random element. Then, j{EX) = j{E{KX)) = Ej{KX) where j : 
K{X) —>■ E is the mapping mentioned in Theorem 3.3 and E j{KX) is the Bochner integral of j{KX). In particular, ifX 
is an integrable K{X)-valued random element, then j{EX) = E j{X). 

Proof. First, observe that j{KX) is a Borel-measurable random element in separable Banach space E and £11 j(AA)|| = 
Ed{9,KX) ^ Ed{9,X) < where the element 9 was mentioned in proof of Theorem 3.3. This remark ensures for 
the existence of Bochner integral of j{KX). Next, Lemma 3.3 in [22] implies that EX =E{KX), hence it is sufficient 
to prove j{E{KX)) = E j{KX). It will be done via using the technique of approximation by simple random elements. 
If A is simple, i.e., A = then 


j{E{KX))=j{[P{ni),KxiY!=i) = 

1=1 
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In general case X G there exists a sequence of simple random elements {Xn = Wn{X)}n^i such that Ed{Xn,X) 0 
and EXf, —>■ EX as n ^ Since the convexification operator K is non-expansive with respect to metric d, we have 
d(E{KXn),E{KX)) ^ Ed{KXn,KX) ^ Ed(Xn,X) 0. On the other hand, the continuity of mappings j and K follows 
that j{KX„) —>■ j{KX), moreover 

\\j{KXn)\\=d{KXn,d)^d{Xn,e)i^d{X„,UQ)+d{uQ,e)^2d{X,UQ)+d{uo,e)&L^. 

Applying the Lebesgue dominated convergence theorem in K and combining with the case above, we obtain 
j{E{KX)) = j{ lim E{KXn)) = lim j{E{KXn)) = lim Ej{KXn) = Ej{KX). 

n^oo n—>-oo 

The proof is completed. □ 

By Theorem 4.1, we immediately derive the following corollary. 

Corollary 4.2. l)ForXi we have E{[Xi,X\-,X 2 ,Xj\) = [Xi,EX\-,X 2 ,EX 2 \. 

2) Suppose that X G and ^ be a real-valued random variable such that 0 < ^ < 1 fl.^. If ^ andX are independent, 
thenE{[^,X-,\-^,u]) = [E^,EX-, \-E^,Ku], m G X. 

3) Let ^ be a real-valued random variable such that 0 < ^ < 1 fl.^. Then E{[^,u\l —^,v]) = [E^,Ku\\ —E^,Kv\for 
all M, V G X. 

Proof. Applying Theorem 4.1 and property (2.3), we have 

j{E{[XuXp,X2,X2]))=Ej{[XuKXp,X2,KX2])=E{Xij{KX^)+X2j{KX2)) 

= Xij{EX^)pX2j{EX2) = j{[Xi,EXp,X2,EX2]). 
j{E{[^,X-l-^,u]))=Ej{[^,KX-\-^,Ku])=E{^.j{KX)) + {\-E^)j{Ku) 

= E^.Ej{KX) + {\-E^)j{Ku) = j{[E^,EX-l-E^,Ku]). 
j(E{[^,u-l-^,v\))=E(^.j(Ku) + {\-^)j{Kv)) = jm,Ku-\-E^,Kv]). 

The proof is completed by the injection of j. Note that the conclusions in this corollary can be proved directly by using 
the technique of approximation by simple random elements. □ 

Consider a mapping ^ : X —R, it will be called convex if ^ ^ 4=1 Xi(p{xi), for all xi,... ,Xn G X, 

Xi,...,X„ G (0; A,' = 1; It will be called midpoint convex if ^([l/2,x; l/2,y]) < i(p{x) -\-(p(y))/2 for every 

x,y G X; It will be called lower semicontinuous if (p{x) ^ liminf„ (p{xn) whenever x„ x; It will be called affine if 
both (p and —(p are convex. If X is convexifiable, then the notions of convex and affine can be extended for weights 
,..., G [0; 1 ]. It is easy to see that if / is affine, then so is / + c for every c G M. Denote by X' the set of all 
continuous affine mappings / : X —> R. 

Lemma 4.3. IfX is convexifiable and X has more than one element, then X' separates points ofX. In other words, if 
f{x) = f{y) for all f G X', then x = y. 

Proof Assume that there exist two elements x,y G X and xfy such that fix) = f{y) for all / G X'. Let (E, ||.||) be 
the Banach space with dual E* and j : X E D F = jfX) is the mapping as in Theorem 3.3. Since / is affine on X, 
/ = foj^^ is also affine on F, where : F —)■ X is inverse mapping of j. We denote X' = {/ = foj^^ : F R,/ G X'} 
and F* = {g|jr: F —>■ R,g|iF is restriction of g G E* on F}. It is easy to see that F* C X' and X’ = X' (the notation A = B 
means that there exists an one-to-one correspondence K : A ^ B). It follows from x fy that j{x) f j{y) and by the 
Hahn-Banach separation theorem, there exists /z G E* such that h{j{x)) f h{j{y)). Moreover, since j{x),j{y) G F, we 
have h\f{j{x)) f h\f{j{y)). Choosing / = (/i|f)o 7 , we obtain / G X' and /(x) f f{y), this is the contradiction. It 
implies x = y, so X' separates points of X. □ 

Remark 3. If X is not convexifiable, then X' does not separate points of X in general. Indeed, let (X, ||.||) be the 
separable Banach space and denote by d the metric associated with norm ||.||. For r > 1, we consider the operation 
.] on X as follows: ''[Xi,xi\"^^ = Y!i=\^[xi- As shown in Example 5 in [21],is the convex combination operation 
(r-th power combination) on (X,c/) and the corresponding convexification operator Ki-x = 0 for all x G X. It implies 
that Kr{X) = {0} and X is not convexifiable. For x G X and / G X' arbitrarily, /('^[n^^xj^Lj) = ^ 4=1 ” */W = /W 
for all n. Taking n^ 00 and using the continuity of /, we have /(x) = f{Krx) = /(O). It means that / is a constant 
function on X, so X' contains only constant functions (moreover X' = R). Hence, X' does not separate points of X. 



Theorem 4.4. Let X be a convexifiable CC space and X be an integrable X-valued random element. Then, 

(i) fix) e L^for all f G X'; 

(ii) An element m G X is the expectation ofX if and only if fim) = EfiX) for all f G X'', 


Proof Throughout this proof, we use the notations as in Theorem 3.3 and Lemma 4.3. 

(i) We will prove that for each / G X', there exists a constant C such that |/(x)| ^ C(c/(0,x) + 1) for all x G X. To 
do this, it is sufficient to prove that for each / G X', |/(x)| ^ C(||x|| + 1) for all x G F. Assume to the contrary that the 
conclusion does not hold, then there exists a sequence {Xn}n^\ c F such that |/(x„)| > n(||x„|| + 1) for all n. Since 
0 < ((1 + ||x„||)n)^* ^ 1 for all n ^ 1 and 0 G F, the convexity of F implies G F. We have 


.(l + ||x„||)n 


) ■^((1 + ||x„||)n''^"'''( 


(l + ||x„||)n 


(l + ||x„||)n 


fixn) + (l 


(l + ||x„||)n 


)/( 0 ). 


It follows 


.(l + ||x„||)n. 


- 1 - 


(l + ||x„||)n. 


I/fa) I 

(l + ||x„||)n 


> 1 for all n. 


Taking n ^ the continuity of / implies that the LHS of (4.1) tends to 0, this is the contradiction. Therefore, 
|/(A)| ^ C((f(0,A) + 1) and this inequality implies/(A) GL^. 

(ii) Since X G the conclusion (i) ensures for the existence of EfiX) for all / G X'. The necessity part of (ii) 
is easy, it can be proved through using the technique of approximation by simple random elements, so we omit the 
proof. We now prove the sufficiency part. Assume that /(m) = EfiX) for all / G X', the necessity part follows that 
fim) = fiEX) for all / G X'. If X has one element, then EX = m obviously. If X has more than one element, then 
applying Lemma 4.3, we obtain m = EX. □ 

Note that for / G X', 

fiKx)=f{ lim[«^',x]f,=i) = limn^^^/(x) =/(x) 

' n—>oo ' n—“ 

(=1 

for all X G X. Hence, the following corollary is obtained immediately from Theorem 4.4. 

Corollary 4.5. Let X be a CC space and X be an integrable X-valued random element. Then, /(A) = /(AA) G L^ 
for all f € X' C (A(X))' and an element m G A(X) is the expectation of X if and only if fim) = EfiKX) for all 

/g(a(x))'. 

Proposition 4.6. ([22], Theorem 3.1) Let ^ ; X ^ K foe midpoint convex and lower semicontinuous, and let X be an 
integrable X-valued random element. Then (piEX) ^ EtpiX) whenever ^(A) is integrable. 

Proof. This proposition established Jensen’s inequality in CC space and it is a main result of Teran [22]. It was proved 
nicely in [22] by using SLLN. Beside the approach of Teran, we will present in this proof another method through 
combining embedding theorem and a corresponding version of Jensen’s inequality in Banach space. First, we will 
prove that if ^ : X —s- R is midpoint convex and lower semicontinuous then cpiKx) ^ ^(x), x G X. Indeed, since 
[n^/x]]L[ —Ax, the subsequence { [2^’^,x]j^i }m>i tends to Ax when m^°o. Applying the first part of proof of 
Proposition 5.3 (will be given in next section), we have 

2 ^ 

(piKx) = (pi \im < liminf^([2^'”,x]?”i) < liminf 2^"* Y <pix) = <pix). 

^ m—^oo ' ^ ' m^oo ^ 

1=1 

This reason implies ^(AA) ^ ^(A), in particular ^+(AA) ^ ^^(A) where <p^ = max{0, ^}. We now consider two 
cases as follows: 

Case 1. <piKX) is integrable. This implies that EtpiKX) is finite and EtpiKX) ^ EtpiX). With : F A(X), 
putting (p ~ (poj^^ : F R, we derive 

^(x/2+y/2) = ^([l/2,y^'(x);l/2,;^'(y)]) < (cpor^x) + (por\y))/2 = (^(x) + ^(y))/2 

for all x,y G F, it means that (p is midpoint convex on F. Since (p is lower semicontinuous on X and is isometric, (p 
is lower semicontinuous on F. Then, (p is midpoint convex as well as lower semicontinuous on F, it implies that <p is 
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convex on F. Applying Jensen’s inequality ([13], Theorem 3.10(ii)), we get (p(E{j{KX))) ^ E(p{j(KX)). On the other 
hand. Theorem 4.1 follows that ^{j{E{KX))) = (p{E{j{KX))), and this is equivalent to (p{E{KX)) = (p(E{j{KX))). 
Combining the arguments above, we obtain (p(EX) = (p(E{KX)) ^ E(p(KX) sC E(p{X). 

Case 2. (p{KX) is notintegrable. Putting (pn =max{—n,^}, n = 1,2,..., we have (pn\(p and (p„{KX) is integrable 
for each n thanks to (p^ ^ (Pn ^ —n. It is not hard to check that {(p^, (p„,n ^ 1} is also a collection of lower semi- 
continuous and midpoint convex functions on X. According to Case 1, we obtain (p„{EX) ^E(p„{X) for all n. Taking 
n^oo and using the monotone convergence theorem, we derive <p{EX) ^E<p{X). This completes the proof. □ 

4.2 On notion of conditional expectation 

The notion of conditional expectation of a random element taking values in concrete metric spaces was introduced by 
some authors via various ways. For example, Herer [11] constructed this notion in finitely compact metric space with 
nonnegative curvature. Sturm [19] dealt with problem in global NPC space and conditional expectation was defined as 
a minimizer of the “variance”. Other definitions can be found in [4, 10, 16]. In this part, we will discuss the notion 
of conditional expectation in CC space X and stress that all presented results below will extend corresponding ones in 
Banach space. The scheme to construct this notion will be proceeded through approximation method traditionally. 

Let X G L^. lfX= [I(^x=Xi)T^i]'i=i is simple, then the conditional expectation of X relative to a cr-algebra C is 
defined by = [E{I(x^^.^ (A). With this definition, maybe the readers naturally wonder that why we 

do not use another form of conditional expectation, such as E{X\'i^) = [£’(/(x=jc,)|^^)A(];Li (B). This can be clarified 
that the definition (B) will not extend the notion of expectation when = {0,f2}, and a more profound reason is that 
(B) will depend on the representation of X while (A) will not (see property (2.5)). Hence, the definition (A) is more 
suitable than (B). 

From the definition (A) above, we can prove with some simple calculations that if X and Y are simple random 
elements, then t/(£’(X|$#),£’(T|^#)) ^ E{d{X,Y)\^) a.s., where is some sub-cr-algebra of We now consider the 
general case, let X be an integrable random element, i.e., X G E^, the condition expectation of X is defined (up to a 
null set) by E{X\^) = lim„^^E{\j/„{X)\'i^) a.s., where the mapping xj/n was mentioned in the first part of Section 4. 
Note that the limit in the RHS exists due to the completeness of It is easy to see from the above definition 

that if X G then G Moreover, by applying approximation method and the Lebesgue dominated 

convergence theorem for conditional expectation in K., we also find d{E{X\'i^),E{Y\'^)) ^E{d{X,Y)\^) forX,T G 
and in particular, ||£'(X|^^)||a ^ E{\\X\\a\'i^),a G K{X). 

First, we will establish the Lebesgue dominated convergence theorem for conditional expectation in CC space. 

Proposition 4.7. Let X„,X be integrable X-valued random elements. Assume that the following hold: 

(i) d{Xn,X) -G 0 a.s. as n ^ o°, 

(ii) there exist a function / G Lg and some a G X such that ||X„||a ^ / a.s. for all n. 

Then d{E{Xn\‘^),E{X\^)) -G 0 a.s. as n ^ o°. 

Proof. By triangular inequality, d{X„,X) ^ \\Xn\\a + ||-^||a ^ /+ ||-^||a Since ||X||a +/ G L^, it follows from the 
Lebesgue dominated convergence theorem for conditional expectation in K. that 

limc/(£(X„|^),£(X|^))< \imE{d{Xn,X)\^)=E{\\md{Xn,X)y^)=Q a.s. 

n^oo ji—>-00 n^oo 

The proof is completed. □ 

Theorem 4.8. Let X be an integrable X-valued random element. Then, i{E{X\^)) = j{E{KX\^)) = E{j{KX)\^) 
a.s., where j : K(X) —>■ E is the mapping presented in Theorem 3.3. 

Proof. As mentioned in Theorem 4.1, j{KX) is a random element in E and j{KX) G Lg. Hence, there exists the 
conditional expectation £’(y(jrX)|^^), moreoverG j{K{X)) a.s. First, ifX = [fx=Xi)j^i]'i=i simple, then 
by the definition of conditional expectation and the idempotence of K 

EiKXW)=Ei[fx=x.),Kxi]UW) = [E{fx=x.)m,KKxi]ti =E{xm a.s. 

j{E{KX\^))=ji[E{fx=x,)\^),Kxi]U) = tEiIix=x.)mj{Kxi)=E{j{KX)\^) a.s. 

1=1 
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Next, if X G then there exists a sequence {X„,n ^ 1} of simple random elements such that Xn X, ||X„||mo ^ 
2||X||„p, E{Xn\^) —i> E{X\‘i^) a.s. Applying Proposition 4.6, we obtain E{KXn\‘i^) -G E{KX\‘^) a.s. Moreover, it 
follows from the previous case that E{KXn\^) = E{Xn\^) for all n, and the uniqueness of limit implies E{KX\^) = 
E{X\^) a.s. Since j is continuous, 

j{E{KX\'^))=j{ \im E{KXnm) = \im j{E{KXnW)) = \im E{j{KXnW) = E{j{KX)\'^) a.s., 

H—J-OO rt—>oo n—foo 

where the last limit holds due to Lebesgue’s dominated convergence theorem for conditional expectation in Banach 
space. The proof is completed. □ 

It is well-known that definition of conditional expectation E{X\^) via approximate method in separable Banach 
space S' is equivalent to the result: '"ForX € L^, then Y = E{X\'S) if and only ifY G E]g{ff) and EXIa = EYiAfor all 
A G 'S”. The same equivalence in CC space will be established in following result and its proof is based on embedding 
theorem. 

Theorem 4.9. Let X G and a G KfX). Then Y = E{X\'S) if and only ifY€ and E{[lA,X-,Ij,a]) = 

E{[lA,Y-,Ij,a])for all A G 

Proof Necessary: If T =E{X\'S) then Y G {‘S) obviously. For A G by Theorem 4.1 and Theorem 4.8, 

j(E{[lA,Y-JAA))=E{lAj{Y)+lAj{a))=E{lAj{E{X\<S))+Ijj{a))=E{lAE{j{KX)\<S)+Ijj{a)) 

= E{E{lAj{KX)\<S) +/j7(«)) = E{lAj{KX) +lAj{a)) = ;(£([/A,2f;/j,fl])). 

The injection of j implies £ ([Ti, A; /j, a]) = £ ( [/a, T; /j, a]). 

Sufficiency: Assume that there exists Y G such that E{[lA,X\I-^,a\) = £([/A,T;/;p,a]) for all A G We 

now need to prove that Y — E{X\'S). Observe that the conditional expectation £ (A exists due to A G £^. By the 
hypothesis, we have j{E{\lA,X-,I-^,a])) = j{E {[Ia,Y -,Ij,a\)) for all A G 'S, this is equivalent to £(7^7 = E{lAj(Y)) 

for all A G It is obvious that j(Y) is ^^-measurable and integrable, so j{Y) = E{j(KX)\'if). On the other hand, 
E{j{KX)\^) = ;(£(A|^)) by Theorem4.8. Thus j{Y) = ;(£(A|^)) and it follows that Y = £(A|^). □ 

The proposition below will give some basic properties of conditional expectation. The proof is easy thanks to 
Theorem 4.1 and Theorem 4.8. 

Proposition 4.10. LetX,Y G L\y. Then, the followine hold for <0 G O a.s.: 

1}E{E{X\<S))=EX. 

2) If (7(X) and'S are independent then E{X\S) = EX. 

3) IfX is ‘S-measurable then E{X\'S) = KX. 

4) If^ is a real-valued random variable with 0 < < 1 and ^ is ^-measurable, then 

£([^,A; 1 -^,Y]\^) = [^,E{X\^y, 1 - ^,£(T|^^)]. 

In particular, E{[X,X;l-X,Y]\<S) = [A,£(A|^); 1 - A,£(T|^)]/or A G (0;1). 

5 j//^l,^2 are two a-algebras and C ^2 then £(£(A|^l)|^ 2 ) = £(F(^|^ 2 )|^l) = £(^|^l)- 

The Jensen inequality for conditional expectation in CC space will be given in the following proposition. Note here 
that this result does not totally extend Proposition 4.6. 

Proposition 4.11. Let^): be a midpoint convex and continuous function, sub-o-algebra^ C andletX G£^ 

such that (p{X) € 7^. Then (p{E{X\'S)) ^ £(^(A)|^^) a.s. 

Proof. Combining Jensen’s inequality for Banach-valued conditional expectation (e.g., see Theorem in [24]) with 
embedding Theorem 3.3 and using simultaneously the same scheme as in proof of Proposition 4.6, we will have the 
conclusion. □ 

According to Theorem 4.4(i) and Proposition 4.11, we immediately derive the following corollary. 

Corollary 4.12. l)IfX G£^ then ||£(A|^^)||a ^ £(||A||a|^^) a.s., for arbitrarily a €K{X),p ^ 1. 

2) 7/A G £^ then /(£(A|^)) = £(/(A)|^) = £(/(AA)|^) a.s. for all f G X'. 
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Similar to Banach space, the notion of martingale in CC space can be defined as follows: Let {X„,n ^ 1} C and 
{^n,n ^ 1} be an increasing sequence of sub-a-algebras of The collection ^ 1} is said to be martingale 

if Xn is ,^„-measurable and E{Xn+\\^n) = Xn a.s. for all n ^ 1. Thanks to Corollary 4.12(1), it is easy to verify that 
if {Xm^mn ^ 1} is a martingale then {\\Xn\\aT^n,n ^ 1} is a real-valued submartingale for a G X, p ^ 1 arbitrarily. 
The convergence of martingales will be established in proposition below. 

Proposition 4.13. (i) Let {^n,n ^ 1} be an increasing sequence of sub-U-algebras of and let ^oo = 

IfX G LF^ with some p ^ 1, then E{X\^„) —>■ E{X\^oo) a.s. and in as n 

(ii) Let {^^n,n f 1} be a decreasing sequence of sub-O-algebras of and let IfX G with 

some p ^ 1, then E{X\.^^n) —>■ E{X\.^^oo) a.s. and in as n ^ 

Proof. With the hypothesis in (i) and (ii), ^ 1} is a martingale and ^ 1} is 

an inverse martingale respectively. Combining convergence theorems for Banach space-valued martingales (e.g., see 
Pisier [14], Theorem 1.5 and Theorem 1.14 for conclusion (i); Theorem in [14], Ch.I, Section 1.5 for conclusion (ii)) 
with the embedding Theorem 3.3, we obtain immediately the proof. □ 


The last result in this section, we will establish a version of Birkhoff’s ergodic theorem in CC space. Let T : ^ 

be an .^-measurable transformation. A transformation T is a measure-preserving or, equivalently, P is said to be T- 
invariant measure, if (A)) = P(A) for all A G A set A G satisfying (A) = A is said to be x-invariant 
set and the family of all T-invariant sets will constitute a sub-(7-algebra of . We say that x is an ergodic if is 
trivial, i.e., P(A) = 0 or P(A) = 1 whenever A G J^z. 

Theorem 4.14. Let x be a measure-preserving transformation of the probability space ,P) and J^z be the (7- 

algebra of invariant events with respect to X. IfX G then [n^^ ,XoX‘]"fQ —>■ ^(Al o.s. as n ^ 

Proof. Recall that in Theoreme 3.1 in [16], Raynaud de Fitte proved a version of ergodic theorem in metric space 
by using the technique of approximation by discrete range random elements. To prove our result, we will present 
here another technique via using the embedding theorem. Since X is integrable. Theorem 3.2 in [17] implies that for 
each natural number m, there exists a compact subset ,y(fu,m = of X such that E{d{X,u)I{X ^ Jlfi)) < !/»* and 
without loss of generality, we can assume that C J^+i for all m. For each n,m ^ 1, defining =AoT”-' if 

AoT"^' G and Vm.n-i = u if XoX"^^ f. J(fm, we have 


d{[n-\XoXTiZlE{X\.yz)) ^d{[n-^ 

-\-d{[n 


Y -hn-l I 
,AoT , I 


-1 


-1 


[n-\KY,nAt^) 


,KY, 


fn,i\i=Q 7 1 


+d{[n 'Ji=0 ’ I 

-AKX.f]1l^) + d{[n-\KXoX‘]1l^,EiX\yz)). (4.2) 


We will estimate four parts in RHS of inequality (4.2) as follows. First, since JFm U {m} is compact and Tm,n G JFm U {m} 
for each m. Proposition 5.5 (will be given in next section) follows d{[n Slm ,(]"=0 ) [” ^ TdlYmAI^Q ) ^ 0 as n —>■ oo. 
Second, according to properties (2.3), (2.6) and the definition of Y^.n, we obtain 


di[n-AKY^Ato,[n-AKX.X%^) ^ [n-‘,AoT']”rJ) 

n—\ 71—1 71—1 

^ ^ d{Y„,^i,XoX‘) = n-^ Y. d{XoX',u)I{XoX‘ n-^ Y {d{X,u)I{X (jz X,))oT'. 


i=0 


i=0 


i=0 


For each m, applying the classic Birkhoff ergodic theorem for real-valued random variable d{X,u)I{X ^ Jfm), we 
derive 

71—1 

Y {d{X,u)I{X fz JPm))oX‘ E{d{X,u)I{X f. Jfm)\^z) a.s. as n —5- “o. 

(=0 

Next, applying Theorem 3.3 


71—1 


di[n-\KX.x‘]1l^,EiX\yz))= j(^^oT')-7(£(^|A)) = -£(j(^X)| A) 


(=0 


n—1 


!=0 


^>0 


a.s. as n oo, where the convergence comes from Birkhoff’s ergodic theorem for Banach-valued random element 
j{KX) ([17], Ch.VI, Theorem 9.4). Combining above arguments, we obtain 

limsup(l([n^*,AoT']”Jo ,£'(A|j^t)) ^2E{d{X,u)I{X ^ a.s. for all m. 

71—>-oo 
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Finally, to get the conclusion of theorem, it is sufficient to prove that E{d{X,u)I{X ^ 0 a.s. as m —oo. 

Observe that {E{d{X,u)I{X ^ ^ 1} is a non-increasing sequence, so the almost surely convergence is 

equivalent to the convergence in probability. For e > 0 arbitrarily, 

P{\E{d{X,u)I{X ^ > e) ^ e-^E{d{X,u)I{X ^ Xn)) ^ ^0 asm^oo^ 

and this completes the proof of theorem. □ 


5 Miscellaneous applications and remarks 

Proposition 5.1. If X. is a complete CC space, then K^y^-^(k{X)) = ck(Kx('X)). So the CC space {ck{Kx{^)),Dx) can 
be embedded isometrically into a Banach space such that convex combination structure is preserved. 

Proof. For A € Kj^i^x] (^(^))> there exists B € k{X) such that A — Kj^^x)B — cdKx{B). It follows from the continuity 
of Kx that Kx{B) G kfKxi'X)), so coKx{B) is a compact and convex subset of Kx{X,). It means A = coKx{B) G 
ck{Kx{X.)), thus K^x){k-{'X.)) C ck{Kx{X,)). The inverse implication is easy to obtain thanks to the observation that 
A = coKx{A) = Kj^^x)^ for A G ck{Kx{'£)). □ 

Lemma 3.3 in [18] established an inequality in CC space and it is a useful tool to obtain many limit theorems (see 
[18, 23]). Now by applying Theorem 3.3, this lemma may be proved more easily as follows: 

Proposition 5.2. ([18], Lemma 3.3) Let {fl,',h,',l ^ ^ n} C [0,1] be a collection of nonnegative constants with 
Y 4 =\ Oi = L;Li bi= 1. Then ^ ^ 4=1 l^i ~ bi\d{xi,u), where xi,... ,x„,m G X are arbitrary. 

Proof. With the notations as in Theorem 3.3, we have 


c/([a,-,&,]'Li,[h;,&,]]Li) = ^aij{Kxi)-'^bij{Kxi) = ^{at-bi)j{Kxi) 


i=i 


i=i 


' Z=1 


^{ai-bi){j{Kxi)-j{Ku)) ^'^\ai-bi\\\j{Kxi)-j{Ku)\\ 


i=i 


i=i 


= ^ \ai-bi\d{Kxi,Ku) < ^ \ai-bi\d{xi,u), 
i=l i=l 


where the last estimation follows from property (2.6). □ 

Remark 4. The inequality 


c/([a,-,x,]'Li,[h;,x;]]Li) ^ Y. \ai-bi\d{xi,u) (5.1) 

!=1 

does not hold in general for xi,...,x„ G X. It will be shown via the following example: 

Example 1. Let (X, ||.||) be a Banach space and we consider the operator ^ [A,-, x,]]Lj = ^ 4=1 ^fxi. As shown in Example 
5 of [21], (X, ||.||, ^[.,.]) is a CC space. For 0 f x,y G X, we have 

£/( 2 [4/5,x; 1 /5,y], 2 [2/5,x;3/5,y]) = 11( 16x/25 + y/25) - (4x/25 + 9y/25)II = II12x/25 - 8y/25II. 

Choosing y = —x/2, we get ||12x/25 — 8y/25|| = 16||x||/25. On the other hand, [4/5 — 2/5|.||x|| + [1/5 — 3/5|.||y|| = 
3||x||/5 < 16||x||/25, so (5.1) fails with m = 0. 

The result below is the Etemadi SLLN in CC space and it was proved in [21] via approximation method by simple 
random elements. However, a different proof can be obtained by combining Etemadi’s SLLN in Banach space ([7], 
Remark 2) with embedding Theorem 3.3 and using simultaneously the same scheme as in proof of Theorem 4.14. 

Proposition 5.3. ([21], Theorem 5.1) Let {X,Xn ,n f 1} be a sequence of pairwise i.i.d. X-valued random elements. 
Then, EX a.s. as fl —^ 00 . 
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The following proposition will present a special form of Jensen’s equality and it plays an important role in es¬ 
tablishing general case. This inequality can be prove easily by combining Theorem 3.3 and a corresponding version 
in Banach space, moreover it was proved directly by Teran ([22], Lemma 3.2.). However, in proof of this inequality 
below, we will give another direct manner which seems to be more simple than the one of Teran [22]. 

Proposition 5.4. Let ^ : X —>■ K fee a midpoint convex function and GK{X) be a sequence of convex points ofX. 

If {qi}"^l is a sequence of positive rational numbers with YIi=\ ^ ^ 1=1 ^i^Pixi). Furthermore, 

if(p is lower semicontinuous then ^ ^ 4=1 ri(p{xi), where r, > 0, Y!i=i L = 1- 

Proof The first case, we present the proof of inequality above when qi = 1 Inf = Namely, we now prove that 

(p{[n^\xi]l^i) (5.2) 

i=i 


The proof of (5.2) is by induction on n. If n = 2, (5.2) holds clearly by definition of midpoint convex function. Suppose 
that (5.2) holds for n = 2^ {k G N), we will prove that (5.2) also holds with n = 2*+^ Indeed, for {xi,X 2 , ■ ■ ■ ,X 2 *+i} C 
K{X), we obtain 


(P 



= (p 


([I/2,[2-^x,■]J!^;l/2,[2-^x,■] 


1 \ 

!-=2*+iJ J 





2 2*" 2 2*^+‘ 

^ i=l ^ /=2*+l 


1 

^I+T 


2 *+l 

E 

!=1 


Therefore, inequality (5.2) holds for all n =2^ [k G N). Moreover, when n has form 2^, (5.2) holds not only for 
{xi} G1 K{X) but also for {x,} C X. The next step, we will prove that if (5.2) is satisfied for n > 2 then it is also satisfied 
for n — 1. Now let {xi,X 2 ,... ,x„_i} C K{X) and denote x„ = [{n — G K{X), it follows from properties 

(CC.i), (CC.ii), (2.5) and induction hypothesis that 


^([(n-1) \xi\l^l) = q) n ',xi;n ',X 2 ;...;n ',[(«-!) \x,]”^.J^ ^ 


n—i 


= (P{[n \xi]l^i) < - E = “ E 

n n n 

n n 


This implies that 

^ 1=1 

The second case, when each qi is rational, it can be expressed as qi = kijm, where m,ki are natural numbers for all 
i= Then, we have 


= (by (2.5)) 

*1 times kn times 

^ —(P{xi) + ■■■ + —(p{xn) = Y^qi(p{xi) (by (5.2)). 
m fn 

For the remaining conclusion, when (p is lower continuous and n > 0. Then, each positive real number r, is the limit of 
some sequence of positive and increasing rational numbers {qijlj^i- Thus, by the continuity of convex combination 
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operation, we obtain 

(p([n,Xi]-^i) = (p{lhn[qij,xi-,...-,q„j,x„-,l- {qij +■ ■ ■+qnj),a]) (for some a G K{X)) 

< liminf9([^i;,xi;.. 1 - (^i/H- \-q„j),a]) 

< liminf (^qij(p(xi)-\ - \-qnj(p(xn) + (1 - -(by the second case) 

= \im{qij(p{xi)-\ - \- qnj(pixn) + {1 - {qij-\ - \-qnj))(p{a)) 

= ri(p{xi)-\ - \-rn(p{xn). 


Combining above arguments, the proposition is proved. □ 

Since the embedding theorem is only available for convexifiable domain K{X) while initial conditions are usually 
imposed on CC space X, it is necessary to estimate quantities in X with themselves in K{X) after affecting convexifi- 
cation operation. The following proposition is such a result. 

Proposition 5.5. Let JfL be a compact subset ofX and {x„,n ^ 1} C J(f. Then, d ([« —>■ 0 as 

n ^ 


Proof. For e > 0 arbitrarily, there exists a finite collection {fi,... ,tm} of elements of Jff such that Jff C e), 

where B(M,r) = {x GX : d{u,x) < r}. Denote A i = JfG nB(fi,e), ...,Ai = JfL nB(f/,e)n{ I =2,... ,m. 

For each n, let us define y„ = ti if x„ G Ai, so d{xn,yn) < £ for all n. By triangular inequality and (2.6), 

d{[n-',Xi]1^l,[n-\Kxi]1^l) < 

+ d{[n-\Kyi]1^i, [n-\Kxi\1^i) 

^ 2n-^'£d{xi,yi) + d{[n-\yi]tu[n-\Kyi]U) < 2e + (/„). 

i=l 

We now show that (/„) = —>■ 0 as n —5- 0 °. For each I = 1,... ,m, put 

Z/,„ = card{l < / ^ n : y,- = f;}, and <^ = {1: 1 ^ 1 > 0}, n ^ 1. 

Then, {z/„,n ^ 1} is the non-decreasing sequence for each 1. By (CC.i) and property (2.1), we obtain 

For each I = 1,... ,m, we have lim„_>ooc/([n^*,f/]"=i = 0 by the definition of K. Thus, there exists Wg ^ G N such 

that for all n fn^^ and for all! = 1,..., m, 


d{[n ^,ti\l^^,Kti) <-. 


(5.3) 


We put 


Ni.e,m= max c/([^ A^g,^ = max A?, 

].^k<n£ ]<ri<rry, 


l^l<m 


and choose the smallest integer number Wg ^ such that Wg ^ ^ e ^m.Ne^m-ne,m- Now, for n ^ Wg 
- If Z/,M ^ «e,m 5 then it follows from (5.3) and n^^zi^n ^ 1 that 


■ If 0 < zifl < ng^m, then 
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Hence, for all n ^ ^ 


This implies that 


zi 




e 

m 


(In) ^ E 

! ^ /77- fl ^ ' 


< e 




for all n ^ n'^ so (/„) 0 as n oo. The proof is completed. 
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